Databases of Emotional Speech

نویسنده

  • Nick Campbell
چکیده

This paper presents a personal view of some the problems facing speech technologists in the study of emotional speech. It describes some databases that are currently being used, and points out that the majority of them use actors to reproduce the emotions, thereby possibly falsely representing the true characteristics of emotion in speech. Databases of real emotional speech, on the other hand, present serious ethical and moral problems, since the nature of their contents must, by definition, reveal personal and intimate details about the speakers. 1. OBJECTIVES OF THE PAPER This paper does not set out to provide an inventory of databases available for the study of emotional speech characteristics; the JASA paper by Murray and Arnott [1], the PHYSTA home page [2], and the web pages of Erlangen University and the Salk Institute [3], for example, provide good overviews of such previous work. Instead, the paper presents a personal account of some of the problems facing researchers who wish to study the speech characteristics associated with different emotions. It approaches the issue from the standpoint of speech technology, rather than that of psychology, and describes work planned under a forthcoming JST-funded five-year project for the study of `expressive speech phenomena’ which will include the production of a large-scale emotional-speech database. Rather than present new facts or data, the paper sets out some topics for discussion and raises questions; in the hope that some of the issues may be resolved during the three days of the workshop. 2. EXPRESSIVE SPEECH Linguistic information is all that can be carried by text, but it is only a small part of the spoken message. As humans, when listening to speech, we are sensitive to extra-linguistic information about the identity and the state of the speaker, as well as to paralinguistic information about the speaker’s intentions underlying the utterance. This information is largely missing from computer speech synthesis, and current speech recognition systems make no use of it. In many instances of conversational human communication, the speaker’s intention, signalled by the manner of speech, is as important as the text of the utterance, and in social or phatic communication, often more so. As humans, we have become used to processing such extra-verbal information and will presumably expect it when interacting with machines through the medium of voice.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Building a naturalistic emotional speech corpus by retrieving expressive behaviors from existing speech corpora

A key element in affective computing is to have large corpora of genuine emotional samples collected during natural conversations. Recording natural interactions through telephone is an appealing approach to build emotional databases. However, collecting real conversational data with expressive reactions is a challenging task, especially if the recordings are to be shared with the community (e....

متن کامل

Emotional Aspects of Intrinsic Speech Variabilities in Automatic Speech Recognition

We analyze two German databases: the OLLO database [1] designed for doing speech recognition experiments on speech variabilities, and the Berlin emotional database [2] designed for the analysis and synthesis of emotional speech. The paper tries to find a relation between intrinsic speech variabilities and the emotions. Moreover, we study this relation from the point of view of speech recognitio...

متن کامل

مراحل و نحوه ی تهیه ی دادگان های صوتی هجایی و دایفونی برای سامانه ی تبدیل متن به گفتار فارسی

Abstract Speech databases are part of the concatenative text to speech synthesis systems. Phonetic quality of the databases plays a significant role in the naturalness of the synthesized speech. This paper introduces two syllable and diphone speech databases for Persian and investigates the way of their development and their specifications and their advantages to each other. ...

متن کامل

On the Relationship between Emotional Intelligence and Directive Speech Acts Preference

Language and emotion are two related systems in use, in that one system (emotions) impacts the performance of the other (language). Both of them share their functionality in communication. Since the nature of foreign language classrooms is ideally interactional, emotional intelligence (EI) gains importance. The aim of this study was to find out whether one's total emotional quotient and its com...

متن کامل

A Comparative Study of the Various Emotional Speech Databases

Speech emotional database and recognition is the challenging part of human computer interaction. The current research focuses towards the detection of emotion in various situations, while the database demands more to fetch out the work of recognition. The study investigates the various existing speech databases containing various basic emotions, enhancing the appropriate database development as...

متن کامل

Emotional speech: Towards a new generation of databases

Research on speech and emotion is moving from a period of exploratory research into one where there is a prospect of substantial applications, notably in human–computer interaction. Progress in the area relies heavily on the development of appropriate databases. This paper addresses four main issues that need to be considered in developing databases of emotional speech: scope, naturalness, cont...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000